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^5 ' Abstract 

O \ We introduce and study two new concepts which are essential for the quantitative 

' analysis of the statistical quality of the available galaxy samples. These are the dilu- 

■ tion effect and the small scale fluctuations. We show that the various data that are 

considered as pointing to a homogenous distribution are all affected by these spuri- 
ous effects and their interpretation should be completely changed. In particular, we 



5_j ■ show that finite size effects strongly affect the determination of the galaxy number 



counts, namely the number versus magnitude relation (A^(< m)) as computed from 
the origin. When one computes A^(< m) averaged over all the points of a redshift 
survey one observes an exponent a = D/5 ~ 0.4 compatible with the fractal di- 
mension L) ~ 2 derived from the full correlation analysis. Instead the observation 
of an exponent a ~ 0.6 at relatively small scales, where the distribution is certainly 
not homogeneous, is shown to be related to finite size effects. We conclude there- 
fore that the observed counts correspond to a fractal distribution with dimension 
Z) ~ 2 in the entire range 12 < m < 28, that is to say the largest scales ever probed 
for luminous matter. In addition our results permit to clarify various problems of 
the angular catalogs, and to show their compatibility with the fractal behavior. We 
consider also the distribution of Radio-galaxies, Quasars and 7 ray burst, and we 
show their compatibility with a fractal structure with D ~ 1.6 1.8. Finally we 
have established a quantitative criterion that allows us to define and predict the 
statistical validity of a galaxy catalog (angular or three dimensional). 
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1 Introduction 



The crucial question we are going to discuss in detail is the definition of the 
minimal statistics that is necessary to characterize correctly the large scale 
distribution of matter. The clarification and the definition of this fundamental 
point will allow us to reinterpret and clarify the confiictual situation arising 
from observations of different nature. 



The best information on galaxy distribution would be to know the position 
in space of all the galaxies. The data that are closest to this ideal situation 
are the so-called "volume-limited" (VL) samples The VL samples can 

be extracted from a redshift survey that contains information on the three 
spatial coordinates (angular position plus redshift). Such samples avoid the 
luminosity selection effect related to the observational point: one defines a 
maximum depth and includes in the sample only these galaxies that would be 
visible from any point of this volume. In such a way this VL sample is observer 
independent and its correlation properties can be studied directly. 



The galaxy distribution in all the redshift catalogues currently available 0- 
||T7| show a highly inhomogeneous distribution characterized by the presence 
of clusters, superclusters, filaments walls and large voids up to the boundaries 
of such surveys. 



The modern statistical analysis of these three dimensional samples |TB| show 



that this irregular distribution is characterized by having long range frac- 
tal correlations. In particular Coleman & Pietronero 0, found that the 



galaxy distribution in the CfAl survey [^] is fractal with D f» 1.6 up to 
Rs ~ 20h~^Mpc. This depth is of the order of the radius of the maximum 
sphere that is fully contained in the sample volume. This limit is imposed by 
the request that the correlation function should be computed only in spher- 
ical shell (0). There have been attempts to push Rs to larger values by us- 
ing various weighting schemes for the treatment of boundary conditions ||20[ . 
These methods however, unavoidably introduce artificial homogenization ef- 
fects and therefore should be avoided 0. Sylos Labini et al. found that 
the Perseus-Pisces galaxy redshift surveys has fractal long-range correla- 
tions up to 30/;,~^Mpc with dimension D ^ 2. The fractal dimension in this 
case is somewhat larger than in CfAl. This tendency for a higher value of 
the fractal dimension is related to the poor statistics and small depth of the 
CfAl catalogue and it is confirmed by various other redshift surveys. For 
example a similar behavior has been found by Di Nella et and 
in the LEDA database up to ~ 150h'^Mpc. A first apparent contradic- 
tory result has been found by Strauss et a/.[|2^ (see also |^) analyzing the 
IRAS catalogues, where they found an evidence towards homogenization at 
Rs^lb^ 20h-^Mpc. 
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Fig. 1. The galaxy number counts in the -B-band, from several surveys. In the range 
12 < m < 19 the counts show an exponent a ~ 0.6, while in the range 19 ^ m < 28 
the exponent is a ~ 0.4. 

A different way to get information for larger scales is to compute the condi- 
tional average (in VL samples) from the observation point only. This allows 
us to extend Rs to the full depth of the catalogue, at the expenses of having a 
reduced statistics. We are going to see that in this way it is possible to detect 
fractal correlations up to 130h~^ Mpc in Perseus-Pisces (PP) |^ and up to 
800 ^ 900h-^Mpc in the new ESP catalogue 0]. 



Historically |]2^ |^ the oldest type of data about galaxy distribution is given 
by the relation between the number of observed galaxies A^(> /) and their 



apparent brightness /. It is easy to show that p8 



N{> f) ~ / 



_D 

" 2 



where D is the fractal dimension of the galaxy distribution. Usually this re- 
lation is written in terms of the apparent magnitude / ~ XO""'^"* (note that 
bright galaxies correspond to small m) . In terms of m, Eq.[^ becomes 

logA^(< m) ~ am (2) 



with a = D/^ ||28|. Note that a is the coefficient of the exponential 
behavior of Eq.^ and we will call it "exponent" even though it should not 
be confused with the exponents of power law behaviors. In Fig.|l] we have 
collected all the recent observations of A^(< m) versus m in the B- 

spectral-band (m^). At bright and intermediate magnitude ( 12 < ms ^ 18), 
corresponding to small redshift ( z < 0.2), one obtains a ~ 0.6, while from 
me ~ 19 up to ~ 28 the counts are well fitted by a smaller exponent with 

4^ is that a ~ 0.6 corresponds to 



28 , 36 



a ~ 0.4. The usual interpretation 
D ^ 3 consistent with homogeneity, while at large scales galaxy evolution and 
space time expansion effects are invoked to explain the lower value a ~ 0.4. 
On the basis of the previous discussion of the VL samples we can see that 
this interpretation is untenable. In fact, we know for sure that, at least up 
to Rg ~ 150h~^Mpc there are fractal correlations M, [pil], P2| so one would 
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eventually expect the opposite behaviour. Namely small value of a ~ 0.4 
(consistent with D ^ 2) at small scales followed by a crossover to an eventual 
homogeneous distribution at large scales {a ~ 0.6 and D ^ 3). 



The GNC in the (red) i?-band shows an exponent a ~ 0.37 — 0.41 in the range 
20 < i? < 23 13,11 and |5l. Moreover Gardner et al. [|| have studied the 
GNC in the (infrared) iC-band in the range 12 < < 23, and they show that 
the slope of the counts changes at i^' ~ 17 from 0.67 to 0.26 (see also 
?B|). Djorgovski et al. found that the slope of the GNC is little bit higher 
than |^3[, i.e. a = 0.315 ±0.02 between = 20 and 24 magn. The situation is 
therefore quite similar in the different spectral bands. The puzzling behaviour 
of the GNC represents the second contradiction we find in the data analysis. 



An additional argument in favour of homogeneity at rather small scales has 
been proposed as arising from an appropriate rescaling of the angular corre- 
lations [^]. This is the third evidence that seems to be conflictual with the 
properties observed in the VL correlations analysis. 

We argue here that this apparently contradictory experimental situation can 
be fully understood on the light of the small scale effects in the space distri- 
bution of galaxies. For example a fractal distribution is non analytic in every 
occupied point: it is not possible to define a meaningful average density be- 
cause we are dealing with intrinsic fluctuations that grow with as the scale 
of the system itself. This situation is qualitatively different from an homo- 
geneous picture, in which a well defined density exists, and the fluctuations 
represent only small amplitude perturbations. The nature of the fluctuations 
in these two cases is completely different, and for fractals the fluctuations 
themselves define all the statistical properties of the distribution. This con- 
cept has dramatic consequences in the following discussion as well as in the 
determination of various observable quantities, such as the amplitude of the 
two point angular correlation function. 

It is worth to notice that small scale effects are usually neglected in the study 
of fractal structures because one can generate large enough structures to avoid 
these problems. In Astrophysics the data are instead intrinsically limited and 
we are going to see that an analysis of finite size effects is very important 
We discuss in detail the problems of finite size effects in the determination 
of the asymptotic properties of fractal distributions, considering explicitly the 
problems induced by the lower cut-off (Section 3). 

We discuss in detail the case of real galaxy redshift survey (Sec. 4 and Sec. 5), 
while in Section 6 we consider the case of the magnitude limited samples. In 
particular, we find a criterion which allows us to define the statistical validity 
of the various samples with respect to the finite size effects. In particular it 
results that the apparent exponent a ~ 0.6 at small scales (bright magnitudes 
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m < 19) arises purely from finite size effects and cannot be related to the real 
correlation properties of the sample. 



In Section 7 we discuss the concept of a statistically fair sample, i.e. a sample 
which contains enough statistical information to be representative of the whole 
distribution from which it has been extracted. In particular we define the lower 
number density of points needed to recover the genuine statistical properties 
of a certain distribution (homogeneous or fractal). Moreover we find that the 
all-sky IRAS surveys [1^], do not contain enough points to be statistically 
fair samples. 

The implications of our analysis is that it provides a quantitative criterion 
to define the statistical validity of a redshift and an angular survey, and for 
the optimization of its geometry to obtain the maximum reliable information 
(Sec. 8). These results can be useful also to design future programs for red- 
shift or angular surveys. On this basis we can predict the expected statistical 
properties of several surveys such as CfA2 [Q, Las Campanas [jl2[, ESP ||T5[| , 
etc. 

In Section 9 we discuss the angular correlation functions, on the light of the 
previous analysis. It results that the scaling of the amplitude of the angular 
correlation function with the depth of the sample, that is considered to be a 
possible evidence for of homogeneity , is actually due to finite size effects 



just as the exponent of the number counts. 

In addition in Section 10 we consider the distribution of Radio-galaxies, Quasars 
and 7 ray bursts. We show that these objects are fractally distributed with 
D ^ 1.6^ 1.8. 

Finally in Section 11 we present our main conclusion, stressing in particular 
the dramatic and puzzling consequences on the general theoretical framework 
and, in particular on the Big-Bang model. 



2 Galaxy number counts: Basic Relations 



The basic assumption that we use to compute all the following relations is 
that: 

iy{L,f)=(P{L)D{r) (3) 

i.e. that the number of galaxies for unit luminosity and volume z/(L, r) can 
be separated as the product of the space density D{r) and the luminosity 
function 4>{L). This is a crude approximation in view of the multifractal prop- 
erties of the distribution (correlation between position and luminosity), and a 
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detailed discussion is found in [^] and ||2^. However, for the propose of the 



present discussion the approximation of Eq.y is rather good and the exphcit 
consideration of the multifractal properties would have a minor effect on the 



properties we are going to discuss ( |]49 



The integrated space number density, .i.e. the total number of points inside a 
sphere of radius R, in the general case, has the following property 



N{< R)= J D{r)Sr = J Y,Kr-ri)d^r = BR^ (4) 


where D = 3 for the homogeneous case, while if D < 3 the distribution is 
fractal and B is the prefactor (see Sec. 3). This is an asymptotic relation 
and the problem is to recover this behavior from finite size portions of real 
structures (we discuss in detail this point in Section 3). 

We briefly introduce some basic definitions. If L is the absolute or intrinsic 
luminosity of a galaxy at distance r, this will appears with an apparent flux 

Suppose all the galaxies have the same luminosity L. If the number of points 
inside a sphere of radius r grows as A^(< r) ~ r^, then it is simple to show 
that the number of galaxies with apparent flux greater than / goes as A^(> 
/) ~ ■ This the basic relation for the GNC Now we consider the case in 
which there is a certain distribution of galaxy luminosities. 

For historical reasons the apparent magnitude m of an object with incoming 
flux / is defined to be (0) 



m = —2.5 logj^o / + constant, (6) 

while the absolute magnitude M is instead related to its intrinsic luminosity 
Lby 

M = —2.5 logio L + constant. (7) 

From Eq.^ that the difference between the apparent and the absolute mag- 
nitudes of an object at distance r is (at relatively small distances, neglecting 
relativistic effects) 

m- M = 5 logio r + 25 (8) 

where r is expressed in Megaparsec {IMpc = 3.26 ■ lO^light years.) 

We now compute the expected GNC in the simplest case of a magnitude 
limited (ML) sample. A ML sample is obtained measuring all the galaxies 
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with apparent magnitude brighter than a certain hmit mum- In this case we 
have (for m < mum) 

N{< m) = 5$(oo)10"^"" (9) 
where 

oo 

$(oo) = / (f){M)10~TiM+25)^^ (^Q) 



We consider now the case of a volume hmited (VL) sample. A VL sample 
contains every galaxy in the volume which is more luminous than a certain 
limit, so that in such a sample there is no incompleteness for an observational 
luminosity selection effect (0], Such a sample is defined by a certain 
maximum distance R and an absolute magnitude limit given b: 

Miim = mirrn - ^loQloR " 25 (11) 

{miim is the survey apparent magnitude limit). By performing the calculations 
for the number-magnitude relation, we obtain 

N{<m) = A{m) -WT^ + C{m) (12) 

where A{m) is 

Mlim 

A{m) = B J 0(M)lO-T(^+25)rfM (13) 

A/(m) 

and M{m) is given by Eq.| (with Rvl in the place of r), and it is a function 
of m. The second term is 

M(m) 

C(m) = BR^ J (j){M)dM (14) 

— oo 

This term, as A{m), depends from the VL sample considered. We assume a 
luminosity function with a Schechter shape |^ 



<j){M)dM ~ 10-o-<^+i)^'^e-io°"'"""'rfM (15) 



where 6 1.1 and the cut-off M* ~ -19.5 111. For M(m) > M* we 

have that C{m) ^ 0, and yl(m) is nearly constant with m. This happens 
in particular for the deeper VL samples for which ~ M*. For the less 

deeper VL {Mum > M*) samples these terms can be considered as a deviation 
from a power law behavior only for m — > mum- In Fig.^we show the behavior 
of Eq.|T2| for the case rriiim = 15.5 and we consider VL samples at VL40: 



7 



Fig. 2. Behaviour of A^(< m) for various VL samples of an ideal survey with 
^lim = 15.5. For the deeper VL sample {120h~^ Mpc) (large squares) a straight 
power law is found, while for the AOh~^ Mpc (small squares) sample there is a 
flattening for m — > mum, even if the true exponent a = D/5 is found at lower 
magnitudes. The sample at 80h~^ Mpc (crosses) show an intermediate situation. 

{R = 40h-^Mpc,M* = -17.53); VL80 {R = SOh-^Mpc,M* = -19.07); 
VL120: {R = UOh^^Mpc, M* = -19.93). We can see that the first subsample 
for m rriiim show a curved behavior while in the other cases the Iog(A^(< 
m))—m relation is well fitted by a power law with the right exponent a = D/5. 
This is the ideal case where no finite size effects are present. 

If one has 4>{M) = S{M — Mq) then it is simple to show that log(A^(< m)) ~ 
{D/5)m also each VL sample. This relation will be useful in the following 
(Section 5). 



3 The problem of finite size effects 



In this section we discuss the general problem of the minimal sample size that 
is able to provide us with a statistically meaningful information. For example 
the mass-length relation for a fractal, that defines the fractal dimension is 
and m) 



g = lim V>' (16) 

r^oo log(r) 

However this relation is properly defined only in the asymptotic limit, because 
only in this limit the fluctuations of the fractal structure are self-averaging. 
A fractal distribution is characterized by large fluctuations at all scales and 
these fluctuations determine all the statistical properties of the structure. If 
the structure has a lower cut-off, as it is the case for any real fractal, one 
needs a "very large sample" in order to recover the statistical properties of 
the distribution itself. Indeed, in any real physical problem we would like to 
recover the asymptotic properties from the knowledge of a finite portion of a 
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Fig. 3. Behavior of the density computed from one point. At small distances, inside 
the Voronoi's length V, one finds almost no galaxies because the total number is 
rather small. Then the number of galaxies starts to grow, but this regime is strongly 
affected by finite size fluctuations. Finally the correct scaling region r ~ A is reached. 
In the intermediate region the density can be approximated roughly by a constant 
value. This leads to an apparent exponent D ~ 3 (in terms of GNC a ~ 0.6). This 
exponent is not real but just due to the size effects. 



Fig. 4. Definition of the Voronoi volume (see text) 

fractal and the problem is that a single finite realization of a random fractal 
is affected by finite size fluctuations due to the lower cut-off. 

Considering homogeneous distribution we can define, an average, a charac- 
teristics volume associated to each particle. This is the Voronoi volume 



Vv whose radius is of the order of the mean particle separation. It is clear 
that the statistical properties of the system can be defined only in volumes 
much larger than v^. Up to this volume in fact we observe essentially nothing. 
Then one begins to include a few (strongly fluctuating) points, and finally, the 
correct scaling behavior is recovered (Fig.^. For a Poisson sample consisting 
of N particles inside a volume V then the Voronoi volume (Fig.^) is of the 
order 

v.-^ (17) 
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and £^ ~ ^y^. In the case of homogeneous distribution, where the fluctuations 
have small amplitude with respect to the average density, one readily recovers 
the statistical properties of the system at small distances, say, r > 5iv 

The case of fractal distribution is more subtle. For a self-similar distribution 
one has, within a certain radius tq, Nq objects. Following [0] we can write the 
mass-length relation between A^(< R), the number of points inside a sphere 
of radius R, and the distance R of the type 

N{< R) = BR^ (18) 

where the prefactor B is related to the lower cut-offs Nq and tq 

B = ^ (19) 

In this case the prefactor B is defined for spherical samples. If we have a 
portion of a sphere characterized by a solid angle Q, we write Eq.|l8| as 

N{< R) = BR^^ (20) 

4:71 

Suppose we have only a finite portion of a fractal structure characterized by 
a volume 

nR^ 

V{R) = ^ (21) 
In this case the Voronoi's volume can be written as 

V, = ^— (22) 
We define 

Vn = (23) 

as the minimal volume which allows us to recover the statistical properties 
of the system. From a series of numerical tests on artificial distributions we 
will conclude that Vq ~ lO^Vy. From this we can define the minimal statistical 
length as 

The minimal statistical length A is an explicit function of the prefactor B and 
of the depth of the survey R. Essentially this discussion implies that A should 
be about one order of magnitude larger than ~ vl^^, apart from solid angle 
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effects. We note tfiat in order to optimize tlie region of tlie sample beyond tfie 
minimal statistical length, we liave to maximize tlie ratio 



-R „ D „ 1 , . 

- ~ R-n3 (25) 
A 

hence if D > 1 it is clearly convenient to increase the depth of the sample 
rather than the solid angle in order to improve the statistics as we discuss in 
detail in the following (see in particular Sec. 7). 

In the case of real galaxy catalogs we have to consider the luminosity selection 
effects. In a volume limited (VL) sample, characterized by an absolute magni- 
tude limit Miimi (minimal absolute flux) the mass-length relation Eq.^, can 
be generalized as (see Sec. 2) 

N{R, MuJ = BR^^^{Mum) (26) 

where ip^Mum) is the probabihty that a galaxy has an absolute magnitude 
brighter than Mum 

(l){M)dM , , 

W(ooj 

where (f){M) is the Schechter luminosity function (Sec. 2) and \l/(oo) is the 
normalizing factor 

^(oo) = / (t){M)dM (28) 



where M^in is the fainter absolute magnitude surveyed in the catalog (usually 
M™„ ^ -10 --11). 

It is possible to compute the intrinsic prefactor B from the knowledge of the 
average conditional density r(r) 0, |21[] as 



r(r) = —Br^-^ (29) 
An 

computed in the VL samples and normalized for the luminosity factor (Eq.pTl). 
In the various VL subsamples of Perseus-Pisces and CfAl redshift surveys we 
find that 

B^l{)^ 15{h-^Mpc)-^ (30) 
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Fig. 5. The spatial density n(r) and A^(< r) computed in two VL sample cut at 
60h~^ Mpc (a, 6), and 70h~^ Mpc {c,d). The density is dominated by large fluctua- 
tions and it has not reached the scaling regime. 



and for R in the range 100 -r- 1000/i ^Mpc we find that a good approximation 
to Eqj31| is 



This is the value of the minimal statistical length that we will use in the fol- 
lowing (see Table 1). This length depends also, but weakly, from the particular 
morphological features of the sample. In the case of Perseus-Pisces {Q = 0.9) 
we find A ~ 50h~^Mpc as shown in Fig.^ and Fig.^ and for CfAl {Q = 1.8) 
A ~ 30h~^Mpc (Fig.0): in both cases the agreement with Eq.|3^ is quite good. 
Moreover we have done the following test: we have cut the Perseus-Pisces sur- 
vey at various solid angles and we have checked that Eq.^ holds with good 
accuracy for various values of R and Q. 

For relatively small volumes it is possible to recover the correct scaling be- 
havior for scales of order of £^ (instead of ~ lOi^) by averaging over several 
samples or, as it happens in real cases, over several points of the same sample 
when this is possible. Indeed when we compute the correlation function we 
perform an average over all the points of the system even if the VL sample 
is not deep enough to satisfy the condition expressed by Eq.|32|. Even in this 
case the lower cut-off introduces a limit in the sample statistics as we point 
out in Sec. 7 and Sec. 8. 

In Section 4 we show how these considerations can be applied to the case of a 
real redshift survey (Perseus-Pisces) as well as in the case of artificial catalogs 
with a priori assigned properties. 



depending on the parameters of the Schechter function M* and 6. From Eq.^ 
and Eq.^ we obtain 




(31) 



A 



(40 ^ 60)h-^Mpc 



(32) 
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Table 1 



The minimal statistical length A for several redshift surveys 



Survey 


n{sr) 


X{h-^Mpc) 


References 


CfAl 


1.8 


30 


Huchra et al, (1983); Davis et al, (1982) 


CfA2 


1.8 


30 


Geller & Huchra, (1988); Park et al, (1994) 


SSRSl 


1.13 


40 


Da Costa et al, (1988; 1991) 


SSRS2 


1.13 


40 


Da Costa et al, (1994) 


PP 


1 


50 


Haynes et al, (1988); Sylos Labini et al, (1995) 


LEDA 


4 TT 


20 


Paturel et al, (1988) 


ORS 


8 


25 


Santiago et al, (1994) 


APM 


1.36 


50 


Loveday et al, (1992a, b) 


LCRS 


0.12 


100 


Schectman, (1992) 


KOSS 


0.01 


250 


Kirshner et al, (1981) 


ESP 


0.006 


300 


Vettolani et al, (1994) 


KOO 


1.5 • 10~* 


950 


Koo, (1988) 
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Fig. 6. The spatial density n(r) and A^(< r) computed in two VL sample VLllO 
(a, 6) and VL130 {c,d). In this case the density is dominated by large fluctuations 
only at small distances, while at larger distances, after the Perseus Pisces chain at 
a very well defined power law behavior is shown, with the same exponent 
of the correlation function of Fig. 8 {D = 2) 



Fig. 7. The spatial density (a) n(r) and the integral number of points (6) A^(< r) 
computed in a VL sample (VL60) of CfAl. A very well defined power law behavior 
with D ~ 2 is shown for r > 30h^^ Mpc ~ A. 

4 Integral from the vertex in Perseus-Pisces 



To clarify the effects of the spatial inhomogeneities and finite size effects we 
have studied the GNC in the Perseus-Pisces redshift survey 0. In a previous 
paper (0]) we have analyzed the spatial properties of galaxy distribution in 
this sample and we briefly summarize our main results. We find that the cor- 
relation function r(r) (conditional average density - 0), computed in several 
VL samples, is 

r(r) = <^(^"^oH^-+^-^o)> ^ ,-7 (33) 
<n> 

where 7 ~ 1 up to ~ 30h~^Mpc. This result means that the galaxy distribution 
in this sample is fractal up to this depth with dimension D = 3 — 7 ~ 2. This 
trend of larger value of D with respect to CfAl {D ^ 1.5) is confirmed by 
all the deeper samples (CfA2 |5§ 0, LEDA ^ [g, ESP 0) and it is 
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probably due to a more stable statistics (see Sec. 7) of this larger sample with 
respect to CfAl. We have also studied the behavior of the galaxy number 
density in the VL samples, i.e. the behavior of: 

n{r) = ~ r^-' (34) 

V{r) 

One expects that, if the distribution is homogeneous the density is constant, 
while if it is fractal it decays with a power low behavior. 

When one computes the correlation function of Eq.|33|, one indeed performs an 
average over all the points of the survey. In particular we limit our analysis 
to the size defined by the radius of the maximum sphere fully contained in 
the sample volume, so that we do not introduce any weighting scheme in the 



treatment of the boundaries of the sample |2l[]. Several authors (e.g. |pO| ) 
have extended the effective depth for the computation of r(r) making use of 
the weighting schemes for the treatment of the boundary conditions: Guzzo et 



a/. [^ found a tendency towards homogenization at ~ 40h ^Mpc in PP, while 



we have limited our analysis at Rg ~ 30h~^Mpc As we show in the following, 
such a method introduces a spurious tendency towards homogenization, and 
must be avoided for system characterized by long-range correlation (see also 



and |2TJ). 



On the contrary Eq.^ is computed only from a single point, the origin. This 
allows us to extend the study of the spatial distribution up to very deep scales 
and in fact we find that Eq.|3^ holds up to ~ 130h~^Mpc, that is the maximum 
depth surveyed by this catalog, with the same exponent 7 fa 1 as before. The 
price to pay is that this method is strongly affected by statistical fluctuations 
and finite size effects. Analogously, when one computes A^(< m), one does not 
perform an average but just counts the points from the origin. As in the case 
of n(r) also N{< m) will be strongly affected by statistical fluctuations due 
to finite size effects. We now clarify how the behaviour of N[< m), and in 
particular its exponent, are influenced by these effects. 

We have computed the n(r) in the various VL sample, and we show the results 
in Fig.|^ and Fig.||. In the less deeper VL samples (VL60, VL70) (Fig.||) we 
see that the density does not show any smooth behavior because in this case 
the finite size effects dominate the behaviour as we are at distances r < A 
(Eq.^2]), while at about the same scales we can find a very well defined power 
law behavior with the correlation function analysis (Fig.^. 

In the deeper VL samples (VLllO, VL130) we can see (Fig.^ that a smooth 
behavior is reached for distances larger than the scaling distance r «i A ~ 
50h~^Mpc. The fractal dimension turns out to be D ~ 2 as in the case of the 
correlation function (Fig.||). Also the amplitude of the density matches quite 
well that of r(r) (if properly normalized). 
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Fig. 8. The correlation function for VL60 (small squares) and VL70 crosses) and 
VLllO (large squares). In this case the correlation function r(r), that is the con- 
ditional density averaged from each point of the sample, shows a very well defined 
power law behaviour. The finite size effects are eliminated exactly for the averaging 
procedure. The reference line has a slope of —7 = D — 3 = —1 



Fig. 9. The Number counts A^(< m) for the VL samples VL60 and VL70. The slope 
is a ~ 0.6. In this case occurs a flattening for m — > m/j^. 

We show in Fig.|^, Fig.|TD| and Fig.|TT| the behaviour of A^(< m) respectively 
for the VL samples of Fig.| (VL60 and VL70), of Fig.| (VLllO, VL130) and 
finally for the whole magnitude limit sample. We can see that for VL60 and 
VL70 there are very strong inhomogeneities in the behaviour of n{r) and 
these are associated with a slope a ~ 0.6 for the GNC (The flattening for 
m miim is just to a luminosity selection effect that is explained in Section 2). 
For VLllO and VL130 the behaviour of the density is much more regular and 
smooth, so that it shows indeed a clear power law behavior. Correspondingly 
the behaviour of A^(< m) is well fitted by a ~ 0.4. Finally the whole magnitude 
limit sample is again described by an exponent a ~ 0.6. 

We have now enough elements to describe the behaviour of the GNC. The 
first point is that the exponent of the GNC is strongly related to the space 
distribution. Indeed what has never been taken into account before is the role 
of finite size effects. The behaviour of the GNC is due to a convolution of 
the space density and the luminosity function (Sec. 2), and the space density 
enters in the GNC as an integrated quantity. The problem is to consider the 
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Fig. 10. The Number counts A^(< m) for the VL samples VLllO and VL130. The 
slope is a ~ 0.4, a part from the initial fast growth. This behavior correspond to a 
well defined define power law behavior of the density with exponent ~ 5a ~ 2 
(Fig.6). 



Fig. 11. The Number counts A^(< m) for the whole magnitude limit sample. The 
slope is a « 0.6 and it is clearly associated only to fluctuations in the spatial 
distribution rather than to a real homogeneity in space. 

right space density in the data analysis. In fact, if we have a very fluctuating 
behaviour for the density in a certain region, as in the case shown in Fig.|^, 
its integral over this range of length scales is almost equivalent to a flat one. 
This can be seen also in Fig.^: at small distances one finds almost no galaxies 
because the total number is rather small. Then the number of galaxies starts 
to grow, but this regime is strongly affected by finite size fluctuations. Finally 
the correct scaling region r «i A is reached. This means for example that if 
one has a fractal distribution, there will be first a raise of the density, due to 
finite size effects and characterized by strong fluctuations, because no galaxies 
are present before a certain characteristic scale. Once one enters in the correct 
scaling regime for a fractal the density becomes to decay as a power law. So 
in this regime of raise and fall with strong fluctuations there will be a region 
in which the density can be approximated roughly by a constant value. This 
leads to an apparent exponent D ~ 3, so that the integrated number grows as 
N{< r) ~ and it is associated in terms of GNC, to a ~ 0.6. This exponent 
is therefore not real but just due to the finite size fluctuations. Only when a 
well defined statistical scaling regime has been reached, i.e. for r > A, one can 
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Fig. 12. (a) n(r); (6) A^(< r) and (c) A^(< m) (single point (squares), averaged 
(crosses)) for an homogeneous distribution. The Voronoi length is ~ 2Mpc. In this 
case the finite size effects do not affect too much the properties of the system because 
an homogeneous distribution is characterized by small amplitude fluctuations. This 
situation is qualitatively different from the fractal one, and for distance r > (2 — 3)iy 
the correct scaling regime is readily found. 

find the genuine scaling properties of the structure, otherwise the behaviour 
is completely dominated by spurious finite size effects. 

The question of the difference between the integration from the origin and 
correlation properties averaged over all points lead us to a subtle problem of 
asymmetric fluctuations in a fractal structure. From our discussion, exempli- 
fied by Fig.^ the region between and A corresponds to an underdensity 
with respect to the real one. However we have also showed that for the full 
correlation averaged over all the points (r(r)) the correct scaling is recovered 
at distances appreciably smaller than A. This means that in some points one 
should observe an overdensity between iy and A. However, given the intrinsic 
asymmetry between filled and empty regions in a fractal, only very few points 
will show the overdensity ( a fractal structure is asymptotically dominated by 
voids). These few points nevertheless will have an important effect on the av- 
erage values of the correlations. This means that, in practice, a typical points 
shows an underdensity up to A as shown in Fig.|^. The full averages instead 
converge at much shorter distances. This discussion shows the peculiar and 
asymmetric nature of finite size fluctuations in fractals as compared to the 
symmetric Poissonian case. 

For homogeneous distribution (Fig.|l2D the situation is in fact quite different. 
Below the Voronoi length iy there are finite size fluctuations, but for distances 
r > (2 -h 4:)iy the correct scaling regime is readily found for the density, the 
integrated density and the number counts. In this case the finite size effects do 
not affect too much the properties of the system because a Poisson distribution 
is characterized by small amplitude fluctuations. 

In the VL samples where n(r) scales with the asymptotic properties (Fig.^) 
the GNC grows also with the right exponent (a = D/5). If we now consider 
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instead the behaviour of the GNC in the whole magnitude hmit survey, we fund 
that the exponent is a ~ 0.6 (Fig.|rT|). This behavior can be understood by 
considering that at small distances, well inside the distance A defined by Eq.|3^, 
the number of galaxies present in the sample is large because there are galaxies 
of all magnitudes. Hence the majority of galaxies correspond to small distances 
(r < A) and the distribution has not reached the scaling regime in which the 
statistical self-averaging properties of the system are present. For this reason in 
the ML sample the finite size fluctuations dominate completely the behavior 
of the GNC. Therefore this behaviour in the ML sample is associated with 
spurious finite size effects rather than to real homogeneity. We discuss in a 
more quantitative way the behavior in ML surveys in Sec. 6, while in Sec. 5 
we perform a test to proof that the exponent a 0.4 is the real statistical 
property of the galaxy distribution. 



5 A test for the finite size effects: average N{< m) 



To prove that the behaviour found in Fig.^, and then the exponent a ~ 
0.6, is connected to large fluctuations due to finite size effects in the space 
distribution and not for real homogeneity, we have done the following test. We 
have adopted the same procedure used for the computation of the correlation 
function (0, [^]), i.e. we make an average for A^(< m) from all the points of 



the sample rather than counting it from the origin only. 

To this aim we have considered a VL sample with galaxies and we have built 
independent flux-limited surveys in the following way. We consider each 
galaxy in the sample as the observer, and for each observer we have computed 
the apparent magnitude of all the other galaxies. To avoid any selection effect 
we consider only the galaxies that lie inside a well defined volume around the 
observer. This volume is defined by the maximum sphere fully contained in 
the sample volume with the observer as a center. 

Moreover we have another selection effect due to the fact that our VL sample 
has been built from a ML survey done with respect to the origin. To avoid 
this incompleteness we have assigned to each galaxy a constant magnitude M. 
In fact, our aim is to show that the inhomogeneity in the space distribution 
plays the fundamental role that determines the shape of the A^(< m) relation, 
and the functional form of the luminosity function enters in Eq.|l^ only as an 
overall normalizing factor. 

Once we have computed Ni{< m) from all the points i = 1,..,N we then 
compute the average. We show in Fig.|T3| and Fig]T^ the results for VL60 and 
VLllO: a very well defined exponent a = D/5 ~ 0.4 is found in both cases. 
This is in fully agreement with the average space density (the conditional 
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Fig. 13. The average A^(< m) in the VL sample VL60. The squares crosses refer 
to the average A^(< m) computed assigning to all the galaxies the same absolute 
magnitude Mq = M* . The reference line has a slope a = 0.4 



Fig. 14. The average A^(< m) in the VL sample VLllO. The squares crosses refer 
to the average A^(< m) computed assigning to all the galaxies the same absolute 
magnitude Mq = M* . The reference line has a slope a = 0.4 



average density r(r)) that shows D ^ 2 in these VL samples. 

We have also performed various tests on artificial distributions with a priori 
assigned properties. Using the random /3— model algorithm |^ and we have 
performed the analysis by assigning to each point of the system the same 
absolute magnitude. The results are in complete agreement with the previous 
findings: if we do not perform any average the exponent of the GNC is strongly 
affected by the presence of fluctuations due to finite size effects and we obtain 
a ~ 0.6, while if we compute the GNC by making an average over all the points 
of the structure we find again that the relation a = D/5 holds in a very well 
approximation (Fig.|l^ and Fig.|l6D. On the contrary in the homogeneous case 
(Fig.^) one does need to perform any average to recover the correct scaling 
properties of N{< m), because the system reaches very soon (r > i^) the 
correct scaling properties. 
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Fig. 15. (a) The space density computed from the vertex in an artificial fractal 
sample with D = 2.5. (6) The A^(< m) relation for the same sample The reference 
line has a slope a = 0.6. 



Fig. 16. (a) The average space density computed in the artificial fractal sample 
with D = 2.5 of Fig. 15. The reference line has a slope D — 3 = —1. (6) The average 
A'^(< m) for the same sample. The reference line has a slope a = D/f> = 0.5. 

6 GNC in Magnitude limited catalogs 



We are now able to clarify the problem of ML catalogs. Suppose to have 
a certain survey characterized by a solid angle Vt and we ask the following 
question: up to which apparent magnitude limit we have to push our 

observations to obtain that the majority of the galaxies lie in the statistically 
significant region (r > A) defined by Eq.^ Beyond this value of mum we 
should recover the genuine properties of the sample because, as we have enough 
statistics, the finite size effects self-average. From the previous condition for 
eachil we can find a solid angle mum so that finally we are able to obtain 
miim = miimi^) in the following way. 

We assume a Schecther luminosity function (Eq.|T^) and that the fractal di- 
mension is D = 2 (we have tested that the final result depends very weakly 
on some reasonable values for the three parameters used: D for the space 



distribution and S and M* for the luminosity function). As shown in Fig. 17 



the requested condition happens when the area I2 > h- We have evaluated 
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Fig. 17. An ideal survey in the d — M space. If the majority of galaxies lie in the sta- 
tistically significant region then we can obtain a meaningful statistical information 
from the survey. This condition implies that the integral I2 is greater than Ii. 



Fig. 18. If a survey defined by the apparent magnitude limit ran^ and the solid angle 
O lie in the statistically significant region it is possible to obtain the self-averaging 
properties of the distribution also with the integral from the vertex. Otherwise one 
needs a redshift survey that contains the three dimensional information, and then 
one can perform average. Only in this way it is possible to smooth out the finite 
size effects. 



numerically these integrals and in Fig.lT8| we show the result that satisfies the 



requested conditions. From the previous figure it follows that for m > 19 the 
statistically significant region is reached for almost any reasonable value of the 
survey solid angle. This implies that in the deep surveys, if we have enough 
statistics, we readily find the right behavior (a = -D/5) while it does not 
happens in a self-averaging way for the nearby samples. Hence the exponent 
a ~ 0.4 found in the deep surveys (m > 19) is a genuine feature of galaxy dis- 
tribution, and corresponds to real correlation properties. In the nearby surveys 
m < 17 we do not find the scaling region in the ML sample for almost any 
value of the solid angle. Correspondingly the value of the exponent is subject 
to the finite size effects, and to recover the real statistical properties of the 
distribution one has to perform an average. 

From the previous discussion it appears now clear why a change of slope is 
found at m ~ 19: this is just a reflection of the lower cut-off of the fractal 
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Photometric Band Driver Metcalfe Tyson Lilly Djorgovski Jones 



U ..... 0.5 

B 0.43 0.49 0.45 0.38 - 

V 0.39 - ... 

R 0.37 0.37 0.39 

I 0.34 - 0.34 0.32 - 

K .... 0.32 



Table 2 

The exponents of galaxy counts in different spectral bands 

structure and in the surveys with mum > 19 the self-averaging properties 
of the distribution cancel out the finite size effects. This result depend very 
weakly on the fractal dimension D and on the parameters of the luminosity 
function 5 and M* used. Our conclusion is therefore that the exponent a ~ 0.4 
for m > 19 is a genuine feature of the galaxy distribution and it is related to 
a fractal dimension D ^ 2, that is found for m < 19 in redshift surveys only 
performing averages. 



We note that this result is based on the assumption that the Schechter lumi- 
nosity function (eq.(15)) holds also at high redshift, or, at least to m ~ 20. 



This result is confirmed by the analysis of Vettolani et al. [|15[ who found that 
the luminosity function up to 2; ~ 0.2 is in excellent agreement with that found 



in local surveys [14 



Finally we report in Table 2 the exponents of the galaxy counts in different 
frequency bands, at faint magnitudes. We can see that the exponents is lower 
than 0.6 in all the case, and it is in the range 0.3^0.5, so that D is in the range 
1.5-i-2.5. These differences can be probably related to the multifractal behavior 
of the luminous matter distribution ||4^ or to a poor statistics. Only a three 



dimensional analysis allows one to decide between these two possibilities. 
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7 Definition of a statistically fair sample 



In the previous sections we tiave defined tlie condition to select a sample large 
enough to manifest the self-averaging properties of a fractal distribution. We 
now study in detail a different but related question. Given a sample with a well 
defined volume, which is the minimum number of points that it should contain 
in order to have a statistically fair sample even if one computes averages over 
all the points. 

Consider a sample which contains a portion of a fractal structure with a lower 
cut-off B. Given the geometry of the sample (depth R and solid angle Q) 
we obtain the value A of the minimal statistical length according to Eq.p^ 
and Eq.|3l]. Now we investigate what happens if we eliminate randomly more 
and more points from the sample. The fractal dimension and the lower cut- 
off {minimal statistical length) will not be affected by this depletion process, 
because they are related only to the intrinsic properties of the fractal structure, 
i.e to B. Instead, the correlation properties are more and more affected by a 
statistical noise as we cut the points that contribute to the statistics. This 
noise is superimposed to the genuine signal so that D and A are not changed 
at all, but the estimation of their values becomes noisy. Obviously, given a 
finite portion of the original system characterized by a lower cut-off B, it will 
exist a maximum value of the number of points that we can eliminate from the 
structure in order to conserve the genuine statistical properties of the original 
distribution. 

At this point we can characterize the statistical information in each VL sample 
more quantitatively. Suppose that the sample volume is a portion of a sphere 
with solid angle Q and radius Rvl, and that the number of points inside this 
volume, NvL, is 

NvL = BvL^R^L- (35) 

where By^ takes into account the luminosity selection effect. The original 
system inside the same volume contains 

N = B^R^^. (36) 
Hence the percentage of galaxies present in the sample can be written as 

In this way we can associate to each VL sample a well defined value of p (see 
Table 3). 
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Table 3 

The percentage of galaxies in some volume limited sample of several surveys. 

RvL is the depth of the volume limited sample and Nvl is the number of points 
contained. The all-sky IRAS catalogs have a very poor statistics (see text) 
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226 
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PP 


1 
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40 
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1.1 % 


IRAS (2Jy) 
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80 
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0.2% 


IRAS (1.2Jy) 


Att 


60 


876 


1.6 Vo 


IRAS (1.2Jy) 


Att 


80 


766 


0.8% 


IRAS (1.2Jy) 


Att 


100 


704 


0.5 %o 



The crucial point is that the random cut of point must stop at a certain limit: 
beyond this limit one does not have in the sample enough points to recover 
the real statistical properties of the distribution. We call statistical fair sample 
a sample that contains a number of points for unit volume larger than this 
limit. The problem is how to define this limit, or, in other words, to determine 
the minimal value of the percentage of Eq.^ that allows one to recover the 
genuine information, for example, by the two points correlation analysis. We 
can proceed in two independent way. The first is by analyzing the correlation 
properties of the VL samples of Perseus-Pisces and CfAl, the second is by 
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Fig. 19. (a) The average conditional density T(r) for VLIOO of Perseus-Pisces and 
the whole ML survey. The reference line has a slope of —7 = —1.1. The percentage 
of galaxies present in the sample is ~ 6%. (b) The same oi (a) but the percentage 
of galaxies present in the sample is ~ 1.2%. This is the limiting case to recover 
the statistical properties of the sample, according to the condition of Eq.37 (c) The 
same of (a), (b) but the percentage of galaxies present in the sample is ~ 0.5%. In 
this case we are well below of the condition of Eq.37. We can see that at small scale 
there is a residual power law behavior, while at large scale the power law behaviour 
is broken. 

studying artificial distributions. 

In the VL limited samples of the Perseus-Pisces survey we can eliminate ran- 
domly points up to reach the fraction contained in the various IRAS samples. 
We note that the correlation function has a clear power law behaviour up to 
30h~^Mpc if the percentage remain larger than 

p>1^2% (38) 

then it shows a cut off towards homogenization (Fig.[T9|). This is clearly spuri- 
ous and due to the fact that we have reached the limit of statistical validity or 
fairness of the sample. The apparent homogeneous behaviour is related to the 
fact that in a given sample, the mean separation among galaxies grows when 
the number of points decreases (A^, ~ (V/NY^^), and when it becomes of the 
same order of the largest void present in the sample volume, the correlation 
properties are destroyed. This means that the artificial noise introduced by 
the depletion of points, have erased the intrinsical fluctuations of the original 
system. In this situation see the system as an homogeneous one. 

We have considered also an artificial catalog with a priori assigned properties 



generated with the random /3— model algorithm [Q. We show the results in 



Fig.^ also in this case we can recover the right statistical properties only in 
the limit of Eq.p8|. Clearly Eq.|38| depends from the morphological features of 
the realization of the fractal structure and the percentage can weakly fluctuate 
from a realization to another 

It is possible to compute the Bs in the VL samples of several surveys and 
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Fig. 20. (a) The average conditional density r(r) for an artificial fractal sample with 
dimension D = 2. The reference line has a slope of —7 = —1. The percentage of 
galaxies present in the sample is ^ 100% (diamonds), ~ 5% (crosses) (this is the 
limiting case to recover the statistical properties of the sample, according to the 
condition of Eq.37) ~ 1% (squares) In this case we are well below of the condition 
of Eq.37. We can see that at small scale there is a residual power law behavior, 
while at large scale the power law behaviour is broken. 



Fig. 21. The average conditional density for some VL samples of the IRAS 1.2 Jy 
redshift survey. The reference line has a slope of -D ~ 2 (7 = 1). The crossover 
towards homogenization is spurious and due to a poor sampling (see Table 3). 

we find that in CfAl, PP, ESP and LEDA Eq.|3^ is well satisfied in almost 
all the VL samples (see Table II). On the contrary the VL samples extracted 
from the IRAS 2Jy survey ^ and the IRAS 1.2Jy ^ survey are well 
beyond the limit of Eq.^ (Table II): this is the reason why and find 



that in these catalogs tq does not scale with sample depth as it should be 
for the fractal case. In fact, in these cases one observes a constant density, 
as the power law correlation have been destroyed by a very poor sampling 
beyond a certain scale (~ 15h~^Mpc) (Fig.pTD. On the other hand Strauss et 
al, stress that IRAS galaxies belong to the same structures as the optical 
ones, and in particular, they point out that the infrared galaxies do not fill 
the voids that define the same highly irregular patterns seen in the optical 
samples. However and |^ conclude that the IRAS galaxies seem to be 



less correlated than the optical ones, as the value of the so-called correlation 
length To is smaller than that of CfAl. Moreover they claim that the sample 
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is homogeneous and that their analysis disprove that galaxy distribution is 
fractal, at least for the infrared galaxies. 

On the contrary our results imply that the correlation properties as the IRAS 
galaxies are the same of the optical ones, even if in the infrared catalogs it is 
not possible to recover the correct statistical features because of a very poor 
statistics (Table II). 



8 Experimental implications 



In Sect. 5 we have defined the minimal statistical length and in Sec. 7 we have 
defined what is a statistically fair sample. We consider now the implications 
of these considerations for the optimization of real redshift surveys. In fact 
we are able to establish a quantitative criterion that defines the statistical 
validity of a survey, so that it is possible to optimize its geometry in order to 
obtain the maximum reliable information. Eq.^ defines the scaling region of 
a certain survey with solid angle Q and Eq.^ Eq.^ give the conditions for 
its statistical validity. 

Considering a ML sample, the survey becomes statistically meaningful, if its 
depth, defined by the apparent magnitude limit, and its solid angle satisfy 
the condition that the majority of galaxies lie in the scaling region. In Fig.|T8| 
we have identified the sample fairness condition for angular surveys. Only the 
deep surveys satisfy this condition, while the nearby samples are all affected by 
finite size effects. This means that one cannot detect statistically meaningful 
properties from such surveys. Only in the case in which one also measure 
the redshifts, as for CfAl for example, one can barely recover the genuine 
properties of the (3d) distribution as the minimal statistical length X is of the 
order of the depth of the catalog. In the case of Perseus- Pisces and CfA2 one 
can find some VL samples, in which the effective depth Rg is appreciably larger 
than A. In these cases it is possible to study the integral from the vertex (i.e. 
A^(< r) or n(r)) and this allows us to extend the analysis up to the deeper 
depth observed {Rg ~ ISOMpc). But for these same catalogs the ML properties 
are strongly affected by finite size effects because most galaxies are to close. 

We report in Table 1 the minimal statistical lengths for several redshift surveys 
05 10' [11 "IHI' H'lHl' some of which are published, while others are in 



progress, and this provides a precise prediction that can be tested in these 
surveys. We have tested such a condition in Perseus-Pisces, CfAl, LEDA and 
ESP and the agreement is very good. 

Related to the definition of a redshift catalog we consider the following prob- 
lem: given the solid angle of the survey (and hence the minimal statistical 
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length A) the crucial point is that the sample should extend enough beyond A, 



having a significant statistics in agreement with Eq.|38|. In order to optimize 
a redshift survey we can define a quantitative criterion to select the optimum 
geometry so that the finite size effects are minimized. A redshift survey is char- 
acterized by two parameters: the solid angle Q and the apparent magnitude 
hmit miim. 

In order to study the statistical properties of the galaxy space distribution 
we have to select from the whole ML survey a certain Volume Limited (VL) 
subsample. The number density of galaxies [uvl) in such a sample is given by 
Eq.|26| and Eq.p7|. In particular this depends on the absolute magnitude limit 
of such a sample which is defined by Eq.^, and is given by (Sec. 2) 

Miim = miim - 5 logio RvL - 25. (39) 

Hence uvl directly depends on mum, fl and Rvl, where Ryi is the maximum 
depth of the VL sample. In particular, the first condition is that uvl, which 
refers to the deepest VL sample obtainable from the ML survey, is such that 

^>(2-3)% (40) 

according to Eq.^ so that the sample is statistically fair. The second condition 
is that 

RvlZ{2^A)X (41) 

In this way the VL extends more than the minimal statistical length so that 
the statistical properties can be recovered. These conditions can be enough 
to determine the best values of (f2,m/jm) but, in order to limit the number of 
redshifts measured we can impose also that the total number of galaxies in 
the whole Magnitude Limited (ML) catalog is at least of the order of = 
(2-=-3)10^ (this number is a reasonable choose considering the real experimental 
surveys). Hence from Eq.^ and Eq.|l^ we have [third condition) 

N{< mum) = 5^$(oo)10T-"'" > M (42) 
47r 

It is possible to find numerically the best solution that satisfies the three 
previous conditions numerically. In Figj2^ we show our results in the {mum, ^) 
space: for a given mum the straight line represents the best solid angle Q 
that maximizes the three conditions. The solid angle Q decreases with the 
increasing magnitude as 



In Fig. 23 we show the correspondent depth of the deepest VL sample; Rvl 
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Fig. 22. The optimum match between the sohd angle O and the apparent magnitude 
Umit rriiim, of a redshift survey that satisfies the three conditions explained in the 
text. For any of these surveys the number of galaxies in the ML catalog is constant 
(A^ ~ (3 -7- 5)10^) as well as it is constant the number of galaxies the deepest VL 
sample. The depth of such VL sample is the order ~ (2 4) A. 



Fig. 23. The depth of the deepest VL sample as a function of the apparent magnitude 
(see Fig. 17) 



grows as 

RvL ~ lO"-^"^"™ (44) 

We can use these results in the following way: suppose that our aim is to study 
the space distribution of galaxies up to, say ~ 800h~^Mpc. From Fig.^ we 
can find the mum of the redshift survey corresponding to such a distance and 



from Fig. 22 the relative solid angle Q. If we follow these conditions we will 



obtain the best statistical information from the minimum number of redshifts 



(~ 3000). From Fig.|2^ we can note that the ESP survey {mum = 19.4 and 
fl = 0.006 |T5[) is almost optimal in this respect, and it can be used to study 
the large scale space distribution from 300h~^Mpc up to ~ (800-i-900)/i~^Mj9c 
and 
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9 The angular two-point correlation function 



Dogterom & Pietronero [§2| (see also 0) studied in detail the surprising and 
subtle properties of the angular projection of a fractal distribution. They find 
that the angular projection produces an artificial crossover towards homoge- 
nization with respect to the angular density. This crossover is artificial (just 
due to the projection) as it does not correspond to any physical features of the 
three dimensional distribution. Moreover they showed that there is an explicit 
dependence of the angular two point correlation function uj{6) on 6^ the sam- 
ple angle: this effect has never been taken into account in the discussion of 
real angular catalogs. These arguments show that it is very dangerous to make 
any definite conclusion just from the knowledge of the angular distribution. 
However, there is a point of the discussion that remains still open. If fact, 
some authors (|2^, claim that one of the most important facts that dis- 
proves the existence of fractal correlations at large scales, is the scaling of the 
amplitude of the two point angular correlation function (ACF) with sample 
depth, in the small angles approximation. We can now clarify this puzzling 
situation. The ACF is defined as 



< n >2 

Assuming that the fractal correlation are only present at small scales, i.e. that 
^(r) = {ro/ry, it is possible to show that in the small angle approximation 



{9 << 1) one has for the homogeneous case that 

u{e) ~ 0'-\ro/D,y (46) 
where the depth is 

is the cut-off of the Schechter luminosity function and / is the limiting fiux 
density of the survey. In the case of a fractal distribution with D < 2 it is easy 



to show that instead of Eq.^ we have p8 



ui9) ~ 9^-^ (48) 

so that the difference between the homogeneous and the fractal case is that 
in the first case the amplitude of the ACF depends on the sample depth 
while in the second case it is constant. Peebles claims that, since in the 
real angular catalogs one observes the scaling of the amplitude |0 and |63 



this provides an evidence against the fractal behavior. We show now that 
this conclusion is not correct because it does not take into account the finite 
size effects in real galaxy surveys, as it is the case of the GNC. In fact, the 



31 



Fig. 24. TliG ajiigiilctr correla/tion function for PcrusGS Pisces liniited. ctt ffiiiYn 

14., 5 

(ml45i) and 15.5 (ml55i). The points refers to different samples of the APM catalog 
scaled to the ACF of the Lick survey (Maddox et al, 1990). The scahng with depth 
of the amplitude of the angular correlation function in this case is due to a finite 
size effects and it is not a proof of homogeneity in space, as we know from the space 
analysis that this sample has fractal behavior up to its deeper depth. 

amplitude of the ACF is strongly related to the behaviour of the angular 
density, i.e. to A^(< m). 

Eq. ^ is obtained under the assumption that the density for a fractal scales 
as r~'^: this is true for the average conditional density in the case of an ideal 
fractal distribution, if the correct scaling regime is reached. The conditional 
density computed form a single point is instead strongly affected by finite size 
effects up to the characteristic minimal statistical length A. 

In order to illustrate this point we present the analysis of the ACF for the 
Perseus-Pisces redshift survey [0. In Fig.2l we show the behavior of the ACF 



(jj{0) for the whole ML survey. We can see that there is a clear scaling of the 
amplitude with the apparent magnitude limit of the survey {mum = 14.5, 15.5 
respectively). We know from the space analysis that the galaxy distribution 
is fractal in this sample and therefore this trend is not a consequence of ho- 
mogeneity, but only of the finite size effects that are especially large for the 
counting from the vertex. In summary the apparent homogeneity inferred from 
the angular catalogs has the same origin as the exponent a ~ 0.6 of the galaxy 
counts at bright magnitudes (small scales). Both arise from finite size effects 
(for a more detailed study on the ACF we refer the reader to 0]) and do not 
correspond to the real statistical properties. 

We note that the exponent of u{9) is also affected by the projection and 
somewhat higher than 1 — 7 0, corresponding to -D ~ 2 obtained in the 3d 
catalog. We find in fact that for 6' << 1 

uj{e) ~ e-^-' (49) 
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Fig. 25. Normalized differential sources counts at v = lAGHz. Abscissa log flux 
density (Jy). Ordinate log differential number of sources n{S). The solid line repre- 
sents the behaviour of a fractal structure with D = 1.8. The agreement is excellent, 
except in the bright fluxes region. 

as it is confirmed in various angular surveys ||3l]] and . The approximation 



of the Limber equation for small angles, that gives the exponent of the ACF is 
just 1 — 7, does not hold exactly, may be because we are close to the limiting 
case D ^ 2. 



Finally we stress that Park et ai, [^, analyzing the CfA2 redshift surveys, 
found the space two point correlation function has an index 7 1 correspond- 
ing to D ~ 2. The CfA2 surveys is just the three dimensional counterpart of 
the Zwicky angular catalog for which Groth & Peebles ||6^ found that the 
ACF, at small angles, scales as uj{d) ~ 0^° ''. This again shows the difficulties 
of recovering the genuine physical properties from the angular analysis alone. 



10 Other astrophysical data 



In Observational Astrophysics there are a lot of data that are only angular 
ones, as the measurements of distances is general a very complex task. We 
briefly discuss here the distribution of radiogalaxies, quasars and the 7-ray 
burst distribution. Our conclusion will be that all these data are compatible 
with a fractal structure with D ^ 1.6 ^ 1.8 similar to that of galaxies. 



The majority of catalogued radio sources are extragalactic and that some 
of the strongest are at cosmological distances. One of the most important 
information on radio galaxies distribution has been obtained from the sources 
counts as a function of the apparent flux [pH]. Extensive surveys of sources 



have been made at various frequencies in the range 0.4 5 GHz. In fig.^ we 



show a compendium of sources counts at u = l.AGHz [^. The differential 
counts is plotted against the apparent flux. We can see that in the bright flux 
region there is a deviation from a power law function, while for four decades 
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the agreement with a fractal distribution with D ^ 1.8 is excellent. Such 
a behaviour has been usually explained in literature as an effect of sources 
evolution. Here we propose that the radio galaxies are fractally distributed, 
as galaxies, with almost the same fractal dimension. The deviation at bright 
fluxes in this picture is explained as a spurious effect due to the small scale 
fluctuations, as in the case of galaxy counts. In the other frequency bands the 
situation is nearly the same (|^). The simple picture of a fractal distribution 
of radio sources is therefore fully compatible with the experimental situation. 

In the case of Quasars the situation is almost the same. In fact we find that 
at bright magnitudes (14.75 < B < 18.75) the exponent of the counts is 
a ~ 0.,88, while at faint magnitude it is a ~ 0.3 |Q. Even in this case we 
can interpret such a behavior as due to the fractal distribution in space with 
D ^ 1.5. 



Finally we comment the distribution of 7-ray burst (GRB). This is a long- 
standing problem in Astrophysics and after 20 years of intense studies and 
observations it is still mysterious: the origin of GRBs, in our galaxy or from 
the Cosmos, is a matter of debate [^] ||6^. We argue here that from 
the present angular and intensity data it is possible to show that the space 
distribution of 7-ray bursts sources is fully compatible with a fractal structure 
with D ^ 1.7. This result clarifies the statistical analysis of the available data 
and points out a fundamental aspect of the 7-ray bursts sources distribution. 



From the angular catalogs recent available 70 we have substantially it is 
possible to study three statistical quantities. The first one is the number versus 
apparent intensity distribution that shows a deviation from the Euclidean 
behaviour at low fluxes 
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[0|. The second is the V/Vmax test |j72| that again 
provides an evidence that the spatial distribution of sources is not homogenous 
|7T| [ [70| . Finally the angular distribution is isotropic within the statistical 
limits ||70|] [^] and there is not any evidence for an angular correlation or 
a clustering of bursts towards the galactic center or along the galactic plane 
of bursts fT^] |7^ [^]. These results together indicate that the bursts 
sources are distributed isotropic but not homogeneously |7D|. We argue here 
that these three evidences are fully compatible with a fractal distribution of 
sources with D ^ 1.7. 



The first observational fact is the number of burst as a function of the apparent 
flux N{> /). At bright apparent flux, that are associated to small distance of 
the sources, one sees an exponent —3/2, that seems to be in agreement with 
the homogeneous case. This is just a spurious effect that aries form the fact 
this quantity is computed without performing any average. At faint apparent 
fluxed, one is integrating the density in the correct scaling regime, and in this 
region the genuine statistical properties of the system can be detected. From 
the A^(> /) relation in the limit of low / we can estimate a fractal dimension 
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D ^ 1.7±0.1. 



An equivalent test on the homogeneity versus fractal properties is given by 
the V/Vmax distribution [|T2[, where 

V f C \~'^ 

(50) 



^max V Clim 

In this ratio V is the volume contained in a sphere extending to the location 
of the bursts and Vmax is the volume of the sphere extending to the maximum 
distance at which the same burst would be detectable by the instrument, 
whose limiting flux is Cum- It is simple to show that if the spatial distribution 
of sources is described a fractal structure, then we have for the average 



Even in this case the data show W^^ that < >= 0.33 ±0.01 that in terms 
of fractal dimension means D ^ 1.5 ± 0.1 This value is somewhat higher than 
the fractal dimension estimated with the A^(> /). This difference is probably 
due to the fact that this test is integrated while the A^(> /) is a differential 
one. 

Let us came to the third evidence, i.e. a substantially isotropic angular distri- 
bution and a lack of any correlation at small angles. As we have seen in the 
previous section, the projection of a fractal distribution on the unit sphere 
conserves the correlations properties only in the small angles approximation, 
while at large angular scale the long range correlation are destroyed by projec- 
tion effects. The angular correlation function oo{6) has a power law behaviour 
in the small angle approximation {6 < 10°). In the available sample [j70| the 



number of points is = 1122 distributed over the whole sky. This means 
that at angular distance smaller than ~ 20° one does not see any other object 
in average, and therefore it is not possible to study the angular correlation 
function at such low angular separation. 

These results indicate together that the 7-ray bursts Sources are fractally 
distributed in space with D 1.7. This is value is very similar to that the 
fractal dimension of the galaxy distribution is space that is D ~ 2 up to some 
hundreds megaparsec. This "coincidence" can be seen as an indication that 
the 7-ray bursts Sources are associate to a population of objects distributed 
as the visible galaxies. We stress that a larger sample of bursts will allow one 
a better determination of the fractal dimension and, if the number of objects 
for steradians will became larger, it will be possible to study also the angular 
correlations in the small angle approximation. 



35 



11 Discussion and conclusions 



Since the Hubble's pioneering work ||2^ tlie galaxy number counts (GNC) has 
been considered as a classical cosmological test and one of the most power- 
ful methods to study the galaxy space distribution as well as the effect of 
luminosity and space time evolution in different spectral regions ||28||-||42||. 



Moreover the apparent exponent a ~ 0.6 at small scale appears to be in 
contradiction with the fractal distribution derived from the full correlation 
analysis in the redshift catalogs. However the discussion of the behaviour of 
the GNC neglects the effect of the non-analyticity of fractal distributions. A 
fractal distribution is characterized by having fluctuations at all scales and, 
in dealing with real fractals characterized by a lower cut-off, a crucial point 
is the effect of finite size fluctuations. The lower cut-off can be defined by 
the Voronoi volume for a Poisson distribution. In case of galaxy surveys the 
Voronoi distance represents the minimal average distance between galaxies, 
and the condition that the survey volume should contain a large number of 
Voronoi volumes, defines the minimal statistical length A beyond which the 
statistical properties are detectable. 

A Poisson distribution is characterized by small amplitude fluctuations that 
represent small perturbations of a well defined average. A fractal distribution 
is characterized by fluctuations that can be as large as the system itself, so 
that the concept of average density looses its physical meaning. These fluctu- 
ations define all the statistical properties of the distribution. The qualitative 
difference between the fluctuations in these two cases is crucial to understand 
the importance of finite size effects. While for a homogeneous distribution one 
readily recovers the genuine statistical properties of the sample at distance, 
say, three times the Voronoi length, in the fractal case it is necessary to have 
a linear size of at least ten times that of the Voronoi volume. For averages 
(r(r), r*(r)) one recovers the correct properties at appreciably smaller length. 

The main problem of the standard analysis of the GNC is related to the fact 
that one computes A^(< m) only from one point, the origin. In this situation 
the fluctuations due to the finite size of the system dramatically affect the 
behaviour of the GNC to rather large scales. 

With the aim of clarifying this point, we have studied in detail the behaviour of 
the GNC in a redshift survey, the Perseus-Pisces, where it is possible to control 
the behaviour of the real space density. We find that the space distribution 
is well described by a fractal with dimension D ?5i 2 up to 130h~^ Mpc (0])- 
From the GNC analysis, we find that the exponent a ~ 0.6 is instead related to 
the presence of strong finite size fluctuations in the density n(r). These strong 
fluctuations affect the behaviour not only of n(r) but also of A^(< m) and 
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make the exponent of the GNC similar to the case of a homogeneous spatial 
distribution, i.e. a f» 0.6. This is because a very fluctuating behavior in the 
density can be roughly approximated by a constant density. In fact at small 
distances (r << A) one finds almost no galaxies. Then the number of galaxies 
start to grow but this regime is strongly affected by fluctuations. Finally the 
correct scaling region is reached for r ~ A, and the regime of raise and fall with 
strong fluctuations can be roughly approximated to a constant one. To show 
that this is indeed the case, we consider in detail the GNC is some VL samples 
of the Perseus-Pisces survey. In the deeper VL samples, where the spatial 
extension of the survey allows the self- averaging of the finite size fiuctuations 
(r > A) and it allows one to recover the smooth power law behavior for the 
density, the exponent of the GNC is simply a = D/5 ~ 0.4. If instead one 
computes the density only from one, the exponent is a ~ 0.6 because of 
the finite size fiuctuations. This happens also in the whole magnitude limit 
survey. The crucial point is that the exponent is a ~ 0.6 in this case is not 
associated to homogeneity in space, but it is due only to the presence of finite 
size fluctuations in the space distribution that do not self-average because the 
A^(< m) relation is computed only from one point, the origin. On the contrary 
if we perform an average from all the points of the survey for A'"(< m) we 
can readily recover the same exponent found with the spatial analysis (a = 
D/5 ~ 2). We have tested these results in some artificial catalogs with a priori 
assigned properties (fractal and homogeneous), finding clear confirmations for 
such a behavior. 

Having this result in mind we can clarify now the problem of the GNC be- 
haviour. In deeper optical samples it is found that a ~ 0.4. Now we can 

interpret this result as the genuine one of a large enough sample where, as 
in the case of the deeper VL samples, the finite size effects are averaged out 
for the presence of structures and voids in such a manner that the scaling 
region is reached and the self- averaging properties of the fractal structure are 
present. This situation can be described in a more quantitative way. We have 
introduced in Section 4 the minimal statistical length A that deflnes the length 
beyond which this behavior occurs. In particular this length is directly related 
to the lower cut-off of the galaxy distribution and to the solid angle of the 
survey. In Section 5 we have considered in detail the change of slope in ML 
surveys in the B-hand, that are the most significant from a statistical point 
of view. 

The crucial point that arises from our analysis is that for m > 19 the ML 
survey extends enough in the statistically significant region, i.e. r > A, for 
almost all the reasonable values of the solid angle of the surveys, so that the 
genuine behavior can be recovered also in this case, because the the finite size 
fluctuations due to the lower cut-off are averaged out. So we can relate the 
exponent found in the deep galaxy catalogs (m > 19) with what we flnd at 
small magnitude and redshift in the appropriate way, concluding that a ~ 0.4 
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is connected to the fractal distribution in space with dimension D 2 [58] 
that we observe at small redshifts and magnitudes (Fig.|l|). 

Moreover all the different values of the exponent of the GNC a found in 
different spectral regions are associated with the presence of these spatial 
fluctuations and are not intrinsic features of the galaxy luminosity distribution. 
For example, in i?-band the results are quite in agreement with those in the 
5-band as the exponent is a ~ 0.4 in both cases. In the i^'— band we find 
a ~ 0.67 in the range K < 11 and this behavior is clearly due to finite size 
fluctuations only. For K > 17 the exponent a ~ 0.3, and this can be probably 
related to the multifractal behavior of the luminous matter distribution 
or to a poor statistics. Only a three dimensional analysis allows one to decide 
between these two possibilities. 

Therefore our main conclusion is that the analysis of the GNC shows a clear 
and strong evidence that the fractal distribution seen in the correlation anal- 
ysis in local redshift surveys, with dimension D ^ 2, extends to the deepest 
depths so that we have a unique fractal structure in the magnitude range 
12 < m < 28 with D ^ 2, that is to say the largest scales ever probed for 
luminous matter. 



In addition we have shown that the counts of radio galaxies. Quasars and 
7 ray bursts are fully compatible with a fractal distribution in space with 
dimension D f» 1.7. The behavior of the real data for these objects looks like 
the case of galaxy counts. For radio galaxies and Quasars we can conclude 
that the genuine scaling behaviour can be found only at faint fluxes, and it is 
related to the fractal properties of the space distribution rather than to the 
effects of evolution of space time. In the case of 7 ray bursts our conclusion is 
that there is a strong evidence that the sources are associate to a population 
of objects distributed as the visible galaxies, as they have almost the same 
fractal dimension. 



The minimal statistical length , that is related to the fractal lower cut-offs 
and the solid angle of a survey has important consequences also from an 
experimental point of view. In fact, only beyond such a length one can find 
the correct scaling properties. This gives a quantitative criterion to define the 
statistical validity of a survey and for the optimization of its geometry in order 
to to derive the maximum reliable information. On this basis we can predict the 
expected statistical properties of several angular and redshift surveys (Table 



1, Fig.0 and Fig.^2D. Moreover from the knowledge of the lower cut-off, we 
can establish a quantitative condition to define a statistically fair sample, i.e. 
a sample that is large enough to allow one to recover the genuine statistical 
properties of the distribution. Such a condition is related to the number of 
points present in the sample to its space extension, and it depends explicitly 
on the lower cut-off of the fractal structure. 
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Finally have we considered the consequences of the finite size effects on the 
determination of the amplitude of the angular two point correlation function. 
Our analysis implies that this quantity is strongly affected by the presence of 
finite size effects, as it is determined from a single point, so that it suffers of 
the same problems of the GNC Indeed the amplitude of the GNC is directly 
related to the angular density, i.e. to A^(< m). As a consequence of these 
spurious effects one has the impression of an apparent homogenization shown 
by the scaling with depth of the amplitude of the ACF, in the small angles 
approximation. This scaling has been considered as an evidence for an 
homogeneous distribution of galaxies, while our results show that it arises 
only from finite size fiuctuations due to the presence of a lower cut-off in 
the galaxy distribution. This analysis shows again that the analysis of the 
angular properties for distribution characterized by long-range correlations is 
very subtle and requires a careful treatment. 

The result that the galaxy distribution can be described as fractal structure 
with D ^ 2 from small scales, up to the deepest scale ever probed for visible 
matter has dramatic consequences for the standard scenario [Q. At relatively 



small scales the evolutionary effects are certainly negligible. However the ob- 
servation of the same behavior in the entire range 12 < m < 28 raises a 
puzzling problem for the standard framework because one would expect at 
large scales some modifications due to the space-time evolution effects and 
the galaxy evolution mechanism. 
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